02:16
2026-06-29
owls.baulab.info
large-language-models
Token Entanglement in Subliminal Learning
Researchers at Anthropic discovered that language models can transfer hidden behaviors through fine-tuning on seemingly meaningless data, a phenomenon called subliminal learning. They identified 'entaβ¦